Visual Subpopulation Discovery and Validation in Cohort Study Data
نویسندگان
چکیده
Epidemiology aims at identifying subpopulations of cohort participants that share common characteristics (e.g. alcohol consumption) to explain risk factors of diseases in cohort study data. These data contain information about the participants’ health status gathered from questionnaires, medical examinations, and image acquisition. Due to the growing volume and heterogeneity of epidemiological data, the discovery of meaningful subpopulations is challenging. Subspace clustering can be leveraged to find subpopulations in large and heterogeneous cohort study datasets. In our collaboration with epidemiologists, we realized their need for a tool to validate discovered subpopulations. For this purpose, identified subpopulations should be searched for independent cohorts to check whether the findings apply there as well. In this paper we describe our interactive Visual Analytics framework S-ADVIsED for SubpopulAtion Discovery and Validation In Epidemiological Data. S-ADVIsED enables epidemiologists to explore and validate findings derived from subspace clustering. We provide a coordinated multiple view system, which includes a summary view of all subpopulations, detail views, and statistical information. Furthermore, intervals for variables involved in a subspace cluster can be adjusted. This extension was suggested by epidemiologists. We investigated the replication of a selected subpopulation with multiple variables in another population by considering different measurements. As a specific result, we observed that study participants exhibiting high liver fat accumulation deviate strongly from other subpopulations and from the total study population with respect to age, body mass index, thyroid volume and thyroid-stimulating hormone.
منابع مشابه
Construct Validation of the Health Literacy Questionnaire (HLQ) in Shahrekord Cohort Study, Iran
Background: Health literacy promotion is considered to be an important goal in the healthcare strategic planning of every country. The present study aimed to evaluate the validity and reliability of the health literacy questionnaire (HLQ) in the participants of Shahrekord cohort study, Iran. Methods: This cross-sectional study was conducted on 400 respondents who were selected via systematic,...
متن کاملAddressing genetic tumor heterogeneity through computationally predictive combination therapy.
UNLABELLED Recent tumor sequencing data suggest an urgent need to develop a methodology to directly address intratumoral heterogeneity in the design of anticancer treatment regimens. We use RNA interference to model heterogeneous tumors, and demonstrate successful validation of computational predictions for how optimized drug combinations can yield superior effects on these tumors both in vitro...
متن کاملDesigning an Ontology for Knowledge Discovery in Iran’s Vaccine
Ontology is a requirement engineering product and the key to knowledge discovery. It includes the terminology to describe a set of facts, assumptions, and relations with which the detailed meanings of vocabularies among communities can be determined. This is a qualitative content analysis research. This study has made use of ontology for the first time to discover the knowledge of vaccine in Ir...
متن کاملMTHFR Glu429Ala and ERCC5 His46His Polymorphisms Are Associated with Prognosis in Colorectal Cancer Patients: Analysis of Two Independent Cohorts from Newfoundland
INTRODUCTION In this study, 27 genetic polymorphisms that were previously reported to be associated with clinical outcomes in colorectal cancer patients were investigated in relation to overall survival (OS) and disease free survival (DFS) in colorectal cancer patients from Newfoundland. METHODS The discovery and validation cohorts comprised of 532 and 252 patients, respectively. Genotypes of...
متن کاملPerformance comparison of four commercial GE discovery PET/CT scanners: A monte carlo study using GATE
Combined PET/CT scanners now play a major role in medicine for in vivo imaging in oncology, cardiology, neurology, and psychiatry. As the performance of a scanner depends not only on the scintillating material but also on the scanner design, with regards to the advent of newer scanners, there is a need to optimize acquisition protocols as well as to compare scanner ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1711.09377 شماره
صفحات -
تاریخ انتشار 2017